Enhancement of esophageal speech using formant synthesis
نویسندگان
چکیده
The feasibility of using the formant analysis-synthesis approach to replace the voicing sources of esophageal speech was explored. The voicing sources were generated by using inverse-filtered signals extracted from normal speakers. Pitch extraction was tested with various pitch extraction methods, then simple auto-correlation method was chosen. Special hardware unit was designed to perform the analysis-synthesis process in real-time. Results of a subjective test showed that the synthesized speech was significantly improved.
منابع مشابه
Analysis and treatment of esophageal speech for the enhancement of its comprehension
This paper resumes an analysis of esophageal speech, and the developing of a method for improving its intelligibility through speech synthesis. Esophageal speech is characterized by low average frequency, while the formant patterns are found to be similar of those of normal speakers. The treatment is different for voiced and unvoiced frames of the signal. While the unvoiced frames are hold like...
متن کاملComparison of formant enhancement methods for HMM-based speech synthesis
Hidden Markov model (HMM) based speech synthesis has a tendency to over-smooth the spectral envelope of speech, which makes the speech sound muffled. One means to compensate for the over-smoothing is to enhance the formants of the spectral model. This paper compares the performance of different formant enhancement methods, and studies the enhancement of the formants prior to HMM training in ord...
متن کاملA Novel Postfiltering Technique Using Adaptive Spectral Decomposition for Quality Enhancement of Coded Speech
Abstract: An adaptive time-domain postfiltering technique based on the synthesis LP filter factorisation is proposed. Information is gathered about the relation between the LP filter poles and formants for this factorisation. This technique shapes the main formant differently from the other formants. Pole locations representing the main formant are modified and optimum shaping constants for eac...
متن کاملIntegration of Rule-based Formant Synthesis and Waveform Concatenation: a Hybrid Approach to Text-to-speech Synthesis
This paper describes an approach to speech synthesis in which waveform fragments dynamically produced with a set of formant-based synthesis rules are concatenated with pre-stored natural speech waveform fragments to produce a synthetic utterance. While this hybrid approach was originally implemented as a tool for research into improved voice quality in formant-based synthesis, it has produced s...
متن کاملVowel Enhancement in Early Stage Spanish Esophageal Speech Using Natural Glottal Flow Pulse and Vocal Tract Frequency Warping
This paper presents an enhancement system for early stage Spanish Esophageal Speech (ES) vowels. The system decomposes the input ES into neoglottal waveform and vocal tract filter components using Iterative Adaptive Inverse Filtering (IAIF). The neoglottal waveform is further decomposed into fundamental frequency F0, Harmonic to Noise Ratio (HNR), and neoglottal source spectrum. The enhanced ne...
متن کامل